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DNA sequence, and recombinant preparation 
of group 4 major allergens from cereals 



Background of the invention 



The present invention relates to the provision of DNA sequences of group 4 
major allergens from cereals (Triticeae), The invention also encompasses 
fragments, new combinations of partial sequences and point mutants hav- 
ing a hypoallergenic action. The recombinant DNA molecules and the 
derived polypeptides, fragments, new combinations of partial sequences 
1 5 and variants can be utilised for the therapy of pollen-allergic diseases. The 
proteins prepared by recombinant methods can be employed for in vitro • 
and in vivo diagnosis of pollen allergies. 



20 Type 1 allergies are of importance worldwide. Up to 20% of the population 
in industrialised countries suffer from complaints such as allergic rhinitis, 
conjunctivitis or bronchial asthma. These allergies are caused by allergens 
present in the air (aeroallergens) which are released by sources of various 
origin, such as plant pollen, mites, cats or dogs. Up to 40% of these type 1 
allergy sufferers in turn exhibit specific IgE reactivity with grass pollen 
allergens, inter alia cereal pollen allergens (Freidhoff et al., 1986, J. Allergy 
Clin. Immunol. 78, 1190-2001). Of the cereal pollen allergens, the allergens 
of rye have particular importance. 

30 

The substances which trigger type 1 allergy are proteins, glycoproteins or 
polypeptides. After uptake via the mucous membranes, these allergens 
react with the IgE molecules bonded to the surface of mast cells in sensi- 
35 tised individuals. If two IgE molecules are crosslinked to one another by an 
allergen, this results in the release of mediators (for example histamine, 
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prostaglandins) and cytokines by the effector cell and thus in the corres- 
ponding clinical symptoms. 



A distinction is made between major and minor allergens, depending on the 

5 

relative frequency with which the individual allergen molecules react with 
the IgE antibodies of allergy sufferers. 

The allergens from the pollen of various species from the family of the 
10 grasses (Poaceae) are divided into groups which are homologous amongst 
one another. 

In particular, the molecules of major allergen group 4 have high immuno- 
logical cross-reactivity with one another both with monoclonal murine anti- 
^5 bodies and also with human IgE antibodies (Fahlbusch et al., 1993 Clin. 

Exp. Allergy 23:51-60; Leduc-Brodard et al.. 1996. J. Allergy Clin. Immunol. 
98:1065-1072; Su et al., 1996. J. Allergy Clin. Immunol. 97:210; Fahlbusch 
et al., 1998, Clin. Exp. Allergy 28:799-807; Gavrovic-Jankulovic et al.. 2000, 
Invest. Allergol. Clin. Immunol. 10 (6):361-367; Stumvoll et al. 2002. Biol. 
Chem. 383:1383-1396; Grote et al., 2002, Biol. Chem. 383:1441-1445; 
Andersson and Lidholm, 2003, Int. Arch. Allergy Immunol. 130:87-107; 
Mari. 2003. Clin. Exp. Allergy, 33 (1):43-51). 

25 A complete DNA sequence is hitherto not known for any of the group 4 
major allergens. 



From the group 4 allergen from Dactylus glomerate, it has hitherto only 
3Q been possible for peptides to be obtained by enzymatic degradation and 
sequenced: 
DIYNYMEPYVSK. 
VDPTDYFGNEQ, 
ARTAWVDSGAQLGELSY 

and GVLFNIQYVNYWFAP (Leduc-Brodard et al.. 1996. J. Allergy Clin. 
Immunol. 98: 1065-1072). 
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103-213 JS.doc 



3/33 



-3- 




Peptides have also been obtained from the group 4 allergen of sub-tropical 
Bermuda grass (Cynodon dactylon) by proteolysis and sequenced: 
KTVKPLYIITP, 

KQVERDFLTSLTKDIPQLYLKS, 

5 

TVKPLYIITPITAAMI, 

LRKYGTAADNVIDAKWDAQGRLL, 

KWQTVAPALPDPNM. 

VTWIESVPYIPMGDK, 
10 GTVRDLLXRTSNIKAFGKY, 

TSNIKAFGKYKSDYVLEPIPKKS, 

YRDLDLGVNQWG, 

SATPPTHRSGVLFNI 
^ 5 and AAAALPTQVTRDIYAFMTPYVSKNPRQAYVNYRDLD (Liaw et al., 

2001, Biochem. Biophys. Research Communication 280: 738-743). 



20 




30 



For Lolium perenne, peptide fragments having the following sequences 
have been described for the basic group 4 allergen: FLEPVLGLIFPAGV 
and GLIEFPAGV (Jaggi et al., 1989, Int. Arch. Allergy Appl. Immunol. 89: 
342-348). 



As the first sequence of a group 4 allergen, the still unpublished sequence 
25 of Phi p 4 from Phleum pratense has been elucidated by the inventors of 
the present patent application and described in International Application 
PCT/EP03/06092. 



Nothing is hitherto known on the sequences of the group 4 major allergens 
from cereals {Triceae). 



35 



The object on which the present invention was based therefore consisted in 
the provision of DNA sequences of group 4 major allergens from cereals, in 
particular the allergen Sec c 4 from rye (Seca/e cerale), Hor v 4 from barley 
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{Hordeum vulgare) and Tri a 4 from wheat (Triticum aestivum) and of 
corresponding recombinant DNA molecules on the basis of which the 
allergens can be expressed as protein and made available, as such or in 
modified form, for pharmacologically significant exploitation. The sequence 
of Phi p 4 was the starting point for the present invention. 



10 




15 



20 




25 



List of sequences according to the invention 



The DNA and protein sequences of the mature allergens are preceded by a 
signal sequence shown in italics and underlined. The respective stop 
codons in the DNA sequences are shown underlined. The encoding region '-^ 
ends with them. -'i 



Figure 1 : DNA sequence of Sec c 4. (a) Isoform Sec c 4.01 , (b) isoform 
Sec c 4.02. 



Figure 2: Protein sequences derived from the DNA sequences in 
accordance with Fig. 1 . 



Figure 3: DNA sequence of Hor v 4. 



lure 4: Protein sequence derived from the DNA sequence in accordance 
with Fig. 3. 



30 Figure 5: DNA sequence of Tri a 4. (a) Isoform Tri a 4.01 , (b) isoform Tri a 
4.02. 



35 



Figure 6: Protein sequences derived from the DNA sequences in 
accordance with Fig. 5. 
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Figure 7: DNA sequence of Phi p 4, in accordance with SEQ ID NO 5 from 
PCT/EP03/06092. 

Figure 8: Protein sequence of Phi p 4. in accordance with SEQ ID NO 6 
from PCT/EP03/06092. 

Description of the invention 

The present invention now provides for the first time DNA sequences of the 
cereal pollen major allergens Sec c 4, Hor v 4 and Tri a 4, as shown in 
Figs. 1, 3 and 5. 

The present invention therefore relates to DNA molecules selected from 
the nucleotide sequences depicted in Figs. 1, 3 and 5. 

The invention furthermore relates to sequences homologous to the DNA 
sequences according to the invention and corresponding DNA molecules of 
group 4 allergens from other Poaceae, such as, for example, Lolium 
perenne, Dactylis glomerata, Poa pratensis, Cynodon dactylon and Holcus 
lanatus, which, owing to the sequence homology that exists, hybridise with 
the DNA sequences according to the invention under stringent conditions, 
or have immunological cross-reactivity with respect to the allergens 
according to the invention. 

The invention also encompasses fragments, new combinations of partial 
sequences and point mutants having a hypoallergenic action. 

The invention therefore furthermore relates to corresponding partial se- 
quences, a combination of partial sequences, or replacement, elimination 
or addition mutants which encode an immunomodulatory, T-cell-reactive 
fragment of a group 4 allergen from the Poaceae, 
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With knowledge of the DNA sequence of the naturally occurring allergens, 
it is now possible to prepare these allergens as recombinant proteins which 
can be used in the diagnosis and therapy of allergic diseases (Scheiner 
and Kraft, 1995. Allergy 50: 384-391). 

A classical approach to effective therapeutic treatment of allergies is spe- 
cific immunotherapy or hyposensitisation (Fiebig, 1995, Allergo J. 4 (6): 
336-339. Bousquet et al., 1998. J. Allergy Clin. Immunol. 102 (4): 558-562). 
In this method, the patient is injected subcutaneously with natural allergen 
extracts in increasing doses. However, there is a risk in this method of 
allergic reactions or even anaphylactic shock. In order to minimise these 
risks, innovative preparations in the form of allergoids are employed. These 
are chemically modified allergen extracts which have significantly reduced 
IgE reactivity, but identical T-cell reactivity compared with the untreated 
extract (Fiebig, 1995, Allergo J. 4 (7): 377-382). 

Even more substantial therapy optimisation would be possible with aller- 
gens prepared by recombinant methods. Defined cocktails of high-purity 
allergens prepared by recombinant methods, optionally matched to the 
individual sensitisation patterns of the patients, could replace extracts from 
natural allergen sources since these, in addition to the various allergens, 
contain a relatively large number of immunogenic, but non-allergenic sec- 
ondary proteins. 

Realistic perspectives which may result in reliable hyposensitisation with 
expression products are offered by specifically mutated recombinant aller- 
gens in which IgE epitopes are specifically deleted without impairing the 
T-cell epitopes which are essential for therapy (Schramm et al., 1999. 
J. Immunol. 162: 2406-2414). 

A further possibility for therapeutic influencing of the disturbed TH cell 
equilibrium in allergy sufferers is immunotherapeutic DNA vaccination, 
which involves treatment with expressible DNA which encodes the relevant 
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allergens. initial experimental evidence of allergen-specific influencing of 
the innmune response has been furnished in rodents by injection of aller- 
gen-encoding DNA (Hsu et al., 1996, Nature Medicine 2 (5): 540-544). 

The present invention therefore also relates to a DNA molecule described 
above or below as medicament and to a corresponding recombinant 
expression vector as medicament. 

The corresponding proteins prepared by recombinant methods can be 
employed for therapy and for in vitro and in vivo diagnosis of pollen aller- 
gies. 

For preparation of the recombinant allergen, the cloned nucleic acid is 
ligated into an expression vector, and this construct is expressed in a suit- 
able host organism. After biochemical purification, this recombinant aller- 
gen is available for detection of IgE antibodies by established methods. 

The present invention therefore furthermore relates to a recombinant 
expression vector comprising a DNA molecule described above or below, 
functionally linked to an expression control sequence, and a host organism 
transformed with said DNA molecule or said expression vector. 

The invention also relates to the use of at least one DNA molecule de- 
scribed above or at least one expression vector described above for the 
preparation of a medicament for the immunotherapeutic DNA vaccination 
of patients with allergies in the triggering of which group 4 allergens from 
the Poaceae, preferably Triticeae, in particular Sec c 4, Hor v 4, Tri a 4, are 
involved and/or for the prevention of such allergies. 

As already stated, the invention can be used as an essential component in 
a recombinant allergen- or nucleic acid-containing preparation for specific 
immunotherapy. A number of possibilities arise here. On the one hand, the 
protein with an unchanged primary structure may be a constituent of the 
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preparation. On the other hand, a hypoallergenic (allergoid) form can be 
used in accordance with the invention for therapy in order to avoid unde- 
sired side effects by specific deletion of IgE epitopes of the molecule as a 
whole or the production of individual fragments which encode T-cell epi- 
topes. Finally, the nucleic acid per se, if ligated with a eukaryotic expres- 
sion vector, gives a preparation which, when applied directly, modifies the 
allergic immune state in the therapeutic sense. 

The present invention furthermore relates to the polypeptides encoded by 
one or more of the DNA molecules described above, preferably in their 
property as medicament. 

These are proteins corresponding to one of the amino acid sequences 
depicted in Figs. 2, 4 and 6. In particular, these are the mature proteins 
(without signal sequence component), beginning with amino acid 23 in the 
case of the amino acid sequences depicted in Figs. 2 and 4 and with amino 
acid 22 in the case of the amino acid sequences depicted in Fig. 6. The 
invention furthermore relates to proteins which contain these amino acid 
sequences or parts of these sequences. 

The invention accordingly also relates to a process for the preparation of 
such polypeptides by cultivation of a host organism and isolation of the 
corresponding polypeptide from the culture. 

The invention likewise relates to the use of at least one polypeptide de- 
scribed above for the preparation of a medicament for the diagnosis and/or 
treatment of allergies in the triggering of which group 4 allergens from the 
Poaceae, preferably Triticeae, in particular Sec c 4, Hor v 4, Tri a 4, are 
involved and for the prevention of such allergies. 

When determining the protein and DNA sequences according to the inven- 
tion, the following procedure was followed: 



2802 l03-213JS.doc 9/33 

^ - 9 - 



Sec c 4 from rye 

1. Starting from the DNA sequence of Phi p 4 (PCT/EP03/06092), specific 

5 

primers (Table 1) derived from the Phi p 4 sequence were generated. Five 
clones were obtained from rye pollen DNA by PGR with primers #87 and 
#83. The amplified Sec c 4 gene fragment 1 corresponding to these clones 
encodes a polypeptide corresponding to amino acids 68-401 of Phi p 4 
10 (Fig. 8 





2. An EST database search was carried out with the partial Sec c 4 
sequence. However, no homologous sequences were found in EST data- " 
bases specialising in rye. Instead, individual, homologous, non-overlapping 
EST fragments were found in EST databases specialising in barley and 
wheat. Individual EST fragments extend into the 5'-UTR region and others 
into the S'-UTR region (UTR = untranslated region) of the corresponding 
genes. 

20 

3. However, a complete group 4 gene from wheat or barley cannot be con- 
structed from the EST sequences found in the databases since these 
sequences do not overlap and a homologous group 4 gene is not known. 

25 However, it was possible to assign these EST sequences with reference to 
the Phi p 4 sequence (Fig. 7) and the Sec c 4 fragment obtained in step 1 
and these served as template for the preparation of PGR primers. 

3Q 4. With the aid of primers #195 and #189 prepared in this way, three clones 
were obtained by PGR. Primer #195 was derived from a barley EST 
sequence, primer #189 is a Phi p 4-specific primer and overlaps the Phi p 4 
stop codon and the codons of the 10 G-terminal Phi p 4 amino acids. The 
Sec c 4 gene fragment 2 amplified in this way encodes a polypeptide, 

35 

beginning within the signal sequence and ending with the position 
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corresponding to position 490 of Phi p 4. This polypeptide covers the N 
terminal of Sec c 4. 



5a. Three further clones were obtained by PGR with primers #195 and 

5 

#202. Both primers were derived from barley EST sequences. The ampli- 
fied Sec c 4 gene 3 encodes the corresponding amino acids beginning 
within the signal sequence and ending at the C terminal of Sec c 4. 
The complete sequence of mature Sec c 4 is thus present in the sequence 
10 determined. 



The next two steps 5b and 5c serve to double-check the result obtained in 
step 5a: 



5b. A further clone was obtained by PGR with primers #195 and #203. 
Primer #195 was derived from a barley EST sequence, primer #203 from a 
wheat EST sequence. The amplified Sec c 4 gene encodes the corre- 
sponding amino acids beginning within the signal sequence and ending at 
the C terminal of Sec c 4. The complete sequence of mature Sec c 4 is 
therefore present in the sequence determined. 



5c. A further clone was obtained by PGR with primers #195 and #198. The 
25 amplified Sec c 4 gene encodes the corresponding amino acids beginning 
within the signal sequence and ending at the G terminal of Sec c 4. The 
complete sequence of mature Sec c 4 is therefore present in the sequence 
determined. 



Two isoforms Sec c 4.01 and 4.02 were found. The mature allergens begin 
at the points indicated in Figs. 1 and 2. 



35 
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Hor V 4 from barley 

With the aid of the Sec c 4 sequences obtained as described above, homo- 
logous EST fragments were found in EST databases of Hordeum vulgare. 
These fragments overlap, but not to give a complete gene. With reference 
to the EST sequences found, however, it was possible to generate Hor v 
4-specific primers, which were used for amplification of the Hor v 4 gene 
from genomic DNA. 

In total, 15 clones were analysed. 

4 clones were obtained by PGR with primers #195 and #1 98. 
4 clones were obtained by PGR with primers #195 and #202. 

3 clones were obtained by PGR with primers #194 and #198. 

4 clones were obtained by PGR with primers #194 and #202. 

The derived protein sequence begins within the signal sequence of Hor v 4 
and extends to the G-terminal end of the protein. 

Tri a 4 from wheat 

With the aid of the Sec c 4 sequences obtained as described above, homo- 
logous EST fragments were found in EST databases of Triticum aestivum. 
These fragments overlap, but not to give a complete gene. With reference 
to the EST sequences found, however, it was possible to generate the Tri a 
4-specific primers #199, #203, #204 and #206, which were used for 
amplification of the Tri a 4 gene from genomic DNA. 

In total, 13 clones were analysed. 

4 clones were obtained by PGR with primers #204 and #203. 
4 clones were obtained by PGR with primers #204 and #199. 

3 clones were obtained by PGR with primers #206 and #203. 

4 clones were obtained by PGR with primers #206 and #199. 
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The derived protein sequences begin within the signal sequence of Tri a 4 
and extend to the C-terminal end of the protein. 
Two variants Tri a 4.01 and Tri a 4.02 were found. 

In order to prepare the recombinant allergens according to the invention, 
the DNA sequences in accordance with Figs. 1, 3 and 5 were incorporated 
into expression vectors (for example pProEx, pSE 380). E. co//-optimised 
codons were used for the N-terminal amino acids known from the protein 
sequencing. 

After transformation in E. co//, expression and purification of the recombi- 
nant allergens according to the invention by various separation techniques, 
the proteins obtained were subjected to a refolding process. 
Both allergens can be employed for highly specific diagnosis of grass pol- 
len allergies. This diagnosis can be carried out in vitro by detection of spe- 
cific antibodies (IgE, lgG1 - 4, IgA) and reaction with IgE-loaded effector 
cells (for example basophiles from blood) or in vivo by skin test reactions 
and provocation at the reaction organ. 

The reaction of the allergens according to the invention with T-lymphocytes 
from grass pollen allergy sufferers can be detected by allergen-specific 
stimulation of the T-lymphocytes for proliferation and cytokine synthesis 
both with T-cells in freshly prepared blood lymphocytes and also on estab- 
lished nSec c 4. nHor v 4 or nTri a 4-reactive T-cell lines and clones. 

The triplets encoding the cysteines were modified by site^specific muta- 
genesis in such a way that they encode other amino acids, preferably ser- 
ine. Both variants in which individual cysteines have been replaced and 
those in which various combinations of 2 cysteine residues or all cysteines 
have been modified were prepared. The expressed proteins of these cys- 
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teine point mutants have greatly reduced or zero reactivity with IgE anti- 
bodies from allergy sufferers, but react with the T-lymphocytes from these 
patients. 



The present invention therefore furthermore relates to a DNA molecule 
described above or below in which one, a plurality of or all the cysteine resi 
dues of the corresponding polypeptide have been replaced with another 
amino acid by site-specific mutagenesis. 



The immunomodulatory activity of hypoallergenic fragments which corres- 
pond to polypeptides having T-cell epitopes and that of the hypoallergenic 
point mutants (for example cysteine replacements) can be detected by their 
-j g reaction with T-cells from grass pollen allergy sufferers. 

Such hypoallergenic fragments or point mutants of the cysteines can be 
employed as preparations for hyposensitisation of allergy sufferers since 
they react with the T-cells with equal effectiveness, but result in reduced 
IgE-mediated side effects owing to the reduced or entirely absent IgE reac- 
tivity. 



If the nucleic acids encoding the hypoallergenic allergen variants according 
25 to the invention or the unmodified DNA molecules according to the inven- 
tion are ligated with a human expression vector, these constructs can 
likewise be used as preparations for immunotherapy (DNA vaccination). 

2Q Finally, the present invention relates to pharmaceutical compositions com- 
prising at least one DNA molecule described above or at least one expres- 
sion vector described above and optionally further active ingredients and/or 
adjuvants for the immunotherapeutic DNA vaccination of patients with 
allergies in the triggering of which group 4 allergens from the Poaceae, 

35 

preferably Triticeae, in particular Sec c 4, Hor v 4, Tri a 4, are involved 
and/or for the prevention of such allergies. 
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A further group of pharmaceutical compositions according to the invention 
comprises at least one polypeptide described above instead of the DNA 
and is suitable for the diagnosis and/or treatment of said allergies. 



Pharmaceutical compositions in the sense of the present invention com- 
prise, as active ingredients, a polypeptide according to the invention or an 
expression vector and/or respective pharmaceutically usable derivatives 
10 thereof, including mixtures thereof in all ratios. The active ingredients 
^ according to the invention can be brought into a suitable dosage form here 

together with at least one solid, liquid and/or semi-liquid excipient or adju- 
vant and optionally in combination with one or more further active ingredi- 
ents. 

Particularly suitable adjuvants are immunostimulatory DNA or oligonucleo- 
tides having CpG motives. 



These compositions can be used as therapeutic agents or diagnostic 
agents in human or veterinary medicine. Suitable excipients are organic or 
inorganic substances which are suitable for parenteral administration and 
do not adversely affect the action of the active ingredient according to the 
invention. Suitable for parenteral use are, in particular, solutions, preferably 
25 oil-based or aqueous solutions, furthermore suspensions, emulsions or 
implants. The active ingredient according to the invention may also be 
lyophilised and the resultant lyophilisates used, for example, for the prepa- 
ration of injection preparations. The compositions indicated may be steril- 
3Q ised and/or comprise adjuvants, such as preservatives, stabilisers and/or 
wetting agents, emulsifiers, salts for modifying the osmotic pressure, buffer 
substances and/or a plurality of further active ingredients. 
Furthermore, sustained-release preparations can be obtained by corres- 
ponding formulation of the active ingredient according to the invention - for 
example by adsorption on aluminium hydroxide. 



35 
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The invention thus also serves for improving in vitro diagnosis as part of 
allergen component-triggering identification of the patient-specific sensiti- 
sation spectrum. The invention likewise serves for the preparation of sig- 
nificantly improved preparations for the specific immunotherapy of grass 
pollen allergies. 



Table 1 Primers used 



10 



a) Sec c 4 
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Primer 
number 


Sequence 


#0083 


GGCTCCCGGGGCGAACCAGTAG 


#0087 


ACCAACGCCTCCCACATCCAGTC 


#0189 


GATAAGCTTCTCGAGTGATTAGTACTTTTTGATCAGCG 
GCGGGATGCTC 


#0195 


GCTCTCGATCGGCTACAATGGCG 


#0198 


CACGCACTACAAATCTCCATGCAAG 


#0202 


CATGCTTGATCCTTATTCTACTAGTTGGGC 


#0203 


TACGCACGATCCTTATTCTACTAGTTGGGC 


a) Hor V 4 


Primer 
number 


Sequence 


#0194 


GCCTTGTCCTGCCACCACGCGGCCGCCACC 


#0195 


GCTCTCGATCGGCTACAATGGCG 


#0198 


CACGCACTACAAATCTCCATGCAAG 


#0202 


CATGCTTGATCCTTATTCTACTAGTTGGGC 


c) Tri a 4 


Primer 
number 


Sequence 


#0199 


CACGCACTAAATCTCCATGCAAG 


#0203 


TACGCACGATCCTTATTCTACTAGTTGGGC 


#0204 


AAGCTCTATCGCCTACAATGGCG 


#0206 


GGTGCTCCTCTTCTGCGCCTTGTCC 



35 
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Patent Claims 

1 . A DNA molecule corresponding to a nucleotide sequence selected from 
one of the sequences depicted in Figures 1 , 3 and 5. 

2. A DNA molecule which hybridises with a DNA molecule according to 
Claim 1 under stringent conditions and originates from DNA sequences 
from Poaceae species. 

3. A DNA molecule, encoding a polypeptide, which cross-reacts immuno- 
logically with the major allergens Sec c 4, Hor v 4 or Tri a 4 from Secale 
cerate, Hordeum vulgare or Triticum aestivum and originates from DNA 
sequences from Poaceae species. 

4. A DNA molecule, corresponding to a partial sequence or a combination 
of partial sequences according to one or more of Claims 1 to 3, which 
encodes an immunomodulatory, T-cell-reactive fragment of a group 4 
allergen from the Poaceae, 

5. A DNA molecule, corresponding to a nucleotide sequence according to 
one or more of Claims 1 to 4, encoding an immunomodulatory T-cell- 
reactive fragment, characterised in that said nucleotide sequence has 
been specifically modified by specific mutation of individual codons, 
elimination or addition. 

6. A DNA molecule according to Claim 5, characterised in that the said 
mutation results in the replacement of one, a plurality of or all cysteines 
of the corresponding polypeptide with another amino acid. 



7 



A recombinant DNA expression vector or a cloning system comprising a 
DNA molecule according to one or more of Claims 1 to 6, functionally 



2002 l03-213JS.doc 17/33 




20 




25 



35 



- 17- 



linked to an expression control sequence. 



8. A host organism transformed with a DNA molecule according to one or 
more of Claims 1 to 6 or an expression vector according to Claim 7. 



9. A process for the preparation of a polypeptide encoded by a DNA se- 
quence according to one or more of Claims 1 to 6 by cultivation of a 
host organism according to Claim 8 and isolation of the corresponding 
1 0 polypeptide from the culture. 



10. A polypeptide corresponding to one of the amino acid sequences 
depicted in Figures 2, 4 and 6, which is encoded by a DNA sequence 

^5 according to one or more of Claims 1 to 6. 

1 1. A polypeptide in accordance with one of the amino acid sequences 
depicted in Figs. 2, 4 and 6, corresponding to the mature allergen, 
selected from the following group of amino acid sequences 

- one of the amino acid sequences depicted in Figures 2 and 4 
beginning with amino acid 23, 

- one of the amino acid sequences depicted in Fig. 6 beginning with 
amino acid 22. 



12. A polypeptide according to Claim 10 or 11 as medicament. 



13. A pharmaceutical composition comprising at least one polypeptide 
3Q according to Claim 12 and optionally further active ingredients and/or 

adjuvants for the diagnosis and/or treatment of allergies in the triggering 
of which group 4 allergens from the Poaceae are involved. 



14. Use of at least one polypeptide according to Claim 12 for the prepara- 
tion of a medicament for the diagnosis and/or treatment of allergies in 
the triggering of which group 4 allergens from the Poaceae are involved 
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and/or for the prevention of such allergies. 

15. A DNA molecule according to one or more of Claims 1 to 6 as medica- 
ment. 

5 

16. A recombinant expression vector according to Claim 7 as medicament. 

17. A pharmaceutical composition comprising at least one DNA molecule 
according to Claim 15 or at least one expression vector according to 
Claim 16 and optionally further active-ingredients and/or adjuvants for 
the immunotherapeutic DNA vaccination of patients with allergies in the 
triggering of which group 4 allergens from the Poaceae are involved 
and/or for the prevention of such allergies. 

18. Use of at least one DNA molecule according to Claim 15 or at least one 
expression vector according to Claim 16 fore the preparation of a medi- 
cament for the immunotherapeutic DNA vaccination of patients with 
allergies in the triggering of which group 4 allergens from the Poaceae 
are involved and/or for the prevention of such allergies. 
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Abstract 

The present invention relates to the provision of DNA sequences of group 4 
major allergens from cereals. The invention also encompasses fragments, 
new combinations of partial sequences and point mutants having a 
hypoallergenic action. The recombinant DNA molecules and the derived 
polypeptides, fragments, new combinations of partial sequences and vari- 
ants can be utilised for the therapy of pollen-allergic diseases. The proteins 
prepared by recombinant methods can be employed for in vitro and in vivo 
diagnosis of pollen allergies. 
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Figure 1(a) 

AACTATAGGGCCTTCGCGCTGGCGCTCCTCTTCTGCGCCTTGTCCTG 

« 

CCAAgCCGCCGCGgCCGCCT ACGCGCCCGTGCCTGCCAAGGCGGACT 

TCCTCGGATGCCTCATGAAGGAGATACCGGCCCGCCTCCTCTACGCC 

AAGAGCTCGCCTGACTACCCCACCGTGCTGGCGCAGACCATCAGGAA 

CTCGCGGTGGTCGTCGCCGCAGAACGTGAAGCCGATCTACATpATCA 

CCCCCACCAACGCCTCGCACATCCAGTCCGCGGTGGTGTGCGGCCGC -• 

CGGCACGGCATCCGCCTCCGCGTGCGGAGCGGCGGCCACGACTACGA 

GGGCCTGTCGTACCGGTCTGAGAAACCCGAGACGTTCGCCGTCGTCG 

ACCTCAACAAGATGCGGGCAGTGTCGGTCGACGGCTACGCCCGCACG ? 
GCGTGGGTCGAATCCGGCGCGCAGCTCGGCGAGCTCTACTACGCGAT ' - | 

CGCCAAGAACAGCCCCGTGCTCGCGTTCCCGGCTGGCGTCTGCCCGT :? 

• ' ■•I 

CCATCGGCGTCGGCGGCAACTTCGCAGGCGGCGGCTTTGGCATGCTG . 

CTGCGCAAGTACGGCATCGCCGCTGAGAACGTCATCGACGTCAAGGT 

GGTCGACCCCAACGGCAAGCTGCTCGACAAGAGCTCCATGAGCGCGG 

ACCACTTCTGGGCCGTTAGGGGCGGCGGCGGAGAGAGCTTTGGCATC 

GTCGTCTCGTGGCAGGTGAAGCTCCTGCCGGTGCCTCCCACCGTGAC 

CGTGCTCAAGATCCCCAAGACGGTGCAAGAAGGCGCCATAGACCTCG 

TCAACAAGTGGCAGCTGGTCGGGCCGGCACTTCCCGGCGACCTCATG 

ATCCGCATCATCCTTGCCGGGAACAGCGCGACGTTCGAGGCCATGTA 

CCTGGGCACCTGCAGTACCCTGACGCCGCTGATGAGCAGCAAATTCC 

CCGAGCTTGGCATGAACCCCTCGCACTGCAACGAGATGTCCTGGATC 

AAGTCCATCCCCTTCATCCACCTCGGCAAGCAGAACCTCGACGACCT 

C C T C AAC C GGAAC AAC ACCT TC AAAC CAT TC GCC GAAT AC AAGT C GG 

ACTACGTGTACCAGCCCTTCCCCAAGCCCGTGTGGGAGCAGATCTTC 
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GGCTGGCTTGTGAAGCCCGGCGCGGGGATCATGATCATGGACCCCTA 
TGGCGCCACCATCAGCGCTACCCCCGAAGCGGCGACGCCGTTCCCTC 
ACCGCCAGGGCGTCCrCTTCAACATCCAGTACGTCAACTACTGGTTC 
GCTGAGTCAGCCGGCGCGGCGCCGCTGCAGTGGAGCAAGGACATATA 
CAAGTTCATGGAGCCGTACGTGAGCAAAAATCCCAGGCAGGCGTATG 
CCAACTACAGGGACATCGACCTTGGCAGGAATGAGGTGGTGAACGAC 
ATCTCCACCTACAGCAGCGGCAAAGTGTGGGGTGAGAAGTACTTCAA 
GGGCAACTTCCAAAGGCTCGCCATTACCAAGGGCAAGGTGGAT-CCTC 
AGGACTACTTCAGGAACGAGCAGAGCATCCCGCCACTGGTCGAGAAG 
TACTGATCGAGGACCTTGCATGGAAATTTAGTGCGTGGTTGGCGTTT 
CACAT 

Figure 1(b) 

AACTCGAGGGCCTTTGCTCTGGTGCCCCTCCTCATCTGCGTCTTGTC 
CrGCCACGCCGCCGrCTCC TACGCGGCGGCGCCGGTGCCGGCCAAGG 
AGGACTTCTTCGGATGCCTGGTGAAGGAGATACCGGCCCGCCTCCTC 
TACGCCAAGAGCTCGCCTGCCTTCCCCACCGTCCTGGCGCAGACCAT 
CAGGAACTCGCGGTGGTCGTCGCCGCAGAGCGTGAAGCCGCTCTACA 
TCATCACCCCCACCAACGCCTCCCACATCCAGTCCGCGGTGGTGTGC 
GGCCGCCGGCACGGCGTCCGCATCCGCGTGCGGAGCGGCGGCCACGA 
CTACGAGGGCCTGTCGTACCGGTCCGAGCGCCCCGAGGCGTTCGCCG 
TCGTCGACCTCAACAAGATGCGGGCCGTGGTGGTCGACGGCAAGGCT 
CGCACGGCGTGGGTGGACTCCGGTGCGCAGCTCGGCGAGCTCTACTA 
CGCCATCGCCAAGAACAGCCCCGTGCTCGCGTTCCCGGCCGGCGTTT 
GCCCGACCATTGGTGTAGGCGGCAACTTCGCTGGCGGCGGCTTCGGC 
ATGCTGCTGCGCAAGTACGGCATCGCCGCCGAGAACGTCATCGACGT 
GAAGGTGGTCGACGCCAACGGCACACTGCTCGACAAGAGCTCCATGA 
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GCGCGGATCACTTCTGGGCCGTCAGGGGCGGCGGCGGAGAGAGCTTC 
GGCATCGTCGTGTCGTGGCAGGTGAAGCTCCTCCCGGTGCCTCCCAC 
CGTGACCGTGTTCAAGATCCCCAAGACGGTGCAAGAAGGCGCCGTAG 
AGCTCATCAACAAGTGGCAGCTAGTCGCGCCGGCCCTCCCCGACGAC 
CTGATGATCCGCATCATCGCTTTCGGCGGCACCGCCAAGTTCGAGGC 
CATGTACCTGGGCACCTGCAAAGCCCTGACACCGCTGATGAGCAGCA 
GATTCCCCGAGCTCGGCATGAACGCCTCGCACTGCAACGAGATGCCC 
TGGATCAAGTCCGTCCCATTCATCCACCTTGGCAAGCAGGCCACCCT 
CTCCGACCTCCTCAACCGGAACAACACCTTCAAACCCTTCGCCGAGT 
ACAAGTCGGACTACGTCTACCAGCCCGTCCCCAAGCCCGTCTGGGCG 
CAGATCTTCGTCTGGCTCGTCAAACCCGGCGCCGGGATCATGGTCAT 

GGACCCCTACGGCGCCGCCATCAGCGCCACCCCCGAAGCCGCCACGC . \ 

CGTTCCCTCACCGCAAGGACGTCCTCTTCAACATCCAGTACGTCAAC 1 
TACTGGTTCGACGAGGCAGGCGGCGCCGCGCCGCTGCAGTGGAGCAA 
GGACATGTACAGGTTCATGGAGCCGTACGTCAGCAAGAACCCCAGAC , 
AGGCCTACGCCAACTACAGGGACATCGACCTCGGCAGGAACGAGGTG 
GTCAACGACATCTCCACCTATGCCAGCGGCAAGGTCTGGGGCGAGAA 
G T AC T T C AAGGGCAAC T T CC AAAGGC T C GCCAT T ACCAAGGGC AAGG 
TGGATCCTCAGGACTACTTCAGGAACGAGCAGAGCATCCCGCCGCTG 
CTAGGGAAG TAGT AGTACTCTTGCTTGCATGGAGATTTGTAGTGCGT 
CTTTCGCGTTTCAAATGCCCAACTAGTAGAATAAGGATCGTGCGTA 



Figure 2(a) 

NYRaFALAL-LFCALgCQAaAAA YAPVPAKADFLGCLMKEIPARLLYA 
KSSPDYPTVLAQTIRNSRWSSPQNVKPIYIITPTNASHIQSAVVCGR 
RHGIRLRVRSGGHDYEGLSYRSEKPETFAWDLNKMRAVSVDGYART 
AWVESGAQLGELYYAIAKNSPVLAFPAGVCPSIGVGGNFAGGGFGML 
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LRKYGIAAENVIDVKWDPNGKLLDKSSMSADHFWAVRGGGGESFGI 
VVSWQVKLLPVPPTVTVLKIPKTVQEGAIDLVNKWQLVGPALPGDLM 
IRIILAGNSATFEAMYLGTCSTLTPLMSSKFPELGMNPSHCNEMSWI 
KS I PFIHLGKQNLDDLLNRNNTFKPFAE YKSDYVYQPFPKPVWEQI F 
GWLVKPGAGIMIMDPYGATISATPEAATPFPHRQGVLFNIQYVNYWF 
AESAGAAPLQWSKDIYKFMEPYVSKNPRQAYANYRDIDLGRNEWND 
ISTYSSGKVWGBKYFKGNFQRLAITKGKVDPQDYFRNEQSIPPLVEK 



Figure 2(b) 

15 JV^S.RAFALyP.L.LJC\/I.gCHA^V5Y AAAPVPAKEDFFGCLVKEIPARLL 
YAKS S PAFPTVLAQTIRNSRWS S PQS VKFLYI IT PTNASHIQS AVVC 
GRRHGVRIRVRSGGHDYEGLSYRSERPEAFAVVDLNKMRAVVVDGKA 
RTAWVDSGAQLGELYYAIAKNSPVLAFPAGVCPTIGVGGNFAGGGFG 

20 MLLRKYGIAAENVIDVKWDANGTLLDKSSMSADHFWAVRGGGGESF 
GIVVSWQVKLLPVPPTVTVFKIPKTVQEGAYELINKWQLVAPALPDD 
LMIRIIAFGGTAKFEAMYLGTCKALTPLMSSRFPELGMISIASHCNEMP 
WIKSVPFIHLGKQATLSDLLNRNNTFKPFAEYKSDYVYQPVPKPVWA 

25 Q I FVWL VKPGAGIMVMDP YGAAI SAT PE AAT P FPHRKDVL FN I Q Y VN 
YWFDEAGGAAPLQWSKDMYRFMEPYVSPCNPRQAYANYRDIDLGRNEV 
VNDISTYASGKVWGBKYFKGNFQRLAITKGKVDPQDYFRNEQSIPPL 
LGK 

30 
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Figure 3 

AGCTCGAGGGCCTTCGCTCTGGTGCTCCTCCTCTGCGCCTTGTCCTG 
CCACCACGCrGCCGrcrCC TCCGCGCAGGTGCCGGCCAAGGACGACT 

TCCTGGGATGCCTCGTGAAGGAGATACCGGCCCGCCTCCTCTTCGCC 
AAGAGCTCGCCTGCCTTCCCCGCCGTCCTGGAGCAGACCATCAGGAA 
CTCGCGGTGGTCGTCGCCGCAGAACGTGAAGCCGCTCTACATCATCA 
CCCCCACCAACACCTCCCACATCCAGTCTGCTGTGGTGTGCGGCCGC 
CGGCACGGCGTCCGCCTCCGCGTGCGGAGCGGCGGCCACGACTACGA 
GGGCCTGTCGTACCGGTCCGAGCGCCCCGAGGCGTTCGCCGTCGTAG 
ACCTCAACAAGATGCGGACCGTGTTGGTCAACGAAAAGGCCCGCACG 
GCGTGGGTGGACTCCGGCGCGCAGCTCGGCGAGCTCTACTACGCCAT 
CGCCAAGAACAGCCCCGTGCTCGCGTTCCCAGCCGGCGTTTGCCCGT 
CCATTGGTGTAGGTGGCAACTTCGCTGGCGGCGGCTTCGGCATGCTG 
CTGCGCAAGTACGGCATCGCCGCCGAGAACGTCATCGACGTCAAGCT 
GGTCGACGCCAACGGCAAGCTGCTCGACAAGAGCTCCATGAGCCCGG 
ACCACTTCTGGGCCGTCAGGGGCGGCGGCGGAGAGAGCTTCGGCATC . 
GTCGTCTCGTGGCAGGTGAAGCTTCTCCCGGTGCCTCCCACCGTGAC 
TGTGTTTCAGATCCCCAAGACAGTGCAAGAAGGCGCCGTAGACCTCA 
TCAACAAGTGGCAGCTGGTCGCGCCGGCCCTTCCCGGCGACATCATG 
25 ATCCGCATCATCGCCATGGGGGACAAAGCGACGTTCGAGGCCATGTA 
CCTGGGCACCTGCAAAACCCTGACGCCGCTGATGAGCAGCAAATTCC 
CGGAGCTTGGCATGAACCCCTCGCACTGCAACGAGATGCCCTGGATC 
AAGTCCATCCCCTTCATCCACCTTGGCAAGCAGGCCACCCTGGCCGA 
30 CCTCCTCAACCGGT^CAACACCTTCAAACCCTTCGCCGAATACAAGT 
CGGACTACGTCTACCAGCCCGTCCCCAAGCCCGTGTGGGAGCAGCTC 
TTCGGCTGGCTCACGAAACCCGGCGCGGGGATCATGGTCATGGACCC 
ATACGGCGCCACCATCAGCGCCACCCCCGAAGCGGCGACGCCGTTCC 
35 CTCACCGCAAGGGCGTCCTCTTCAACATCCAGTACGTCAACTACTGG 



20 
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TTCGCCGAGGCAGCCGGCGCCGCGCCGCTGCAGTGGAGCAAGGACAT 
TTACAAATTCATGGAGCCGTTCGTGAGCAAGAACCCCAGGCAGGCGT 
ACGCCAACTACAGGGACATCGACCTCGGCAGGAACGAGGTGGTGAAC 
G AC AT C T C AACC T AC AGCAG C GGCAAGGTGTGGGG C G AG AAG T AC T T 
CAAGGGCAACT T C C AAAGGCTC GCC ATC ACCAAGGGCAAGGTGGAT C 
CCCAGGACTACTTCAGGAACGAGCAGAGCATCCCGCCGCTGCTGGGC 
AAG TAG TGACCGAGAGTCTTGCATGGAGATTTGTAGTGCGTGCTTGG 
CGTTTCTGAT 



Figure 4 

1 5 SSRAFAL VLZLCALSCHHAA VS S AQVPAKDD FLGCLVKE I PARLLFA 
KS S P A F P AVLEQT I RN S RWS S PQN VKPL Y IITPTNTSH I QS A VVC GR 
RHGVRLRVRS GGH D YEGLS YRS ERP E AFAWDLNKMRT VL VNE KART 
AWVDS GAQLGELYYAIAKNS PVLAFPAGVCPS IGVG.GNFAGGGFGML 

20 LRKYGIAAENVIDVKLVDANGKLLDKSSMSPDHFWAVRGGGGESFGI 
VVSWQVKLLPVPPTVTVFQIPKTVQEGAVDLINKWQLVAPALPGDIM 
IRIIAMGDKATFEAMYLGTCKTLTPLMSSKFPELGMNPSHCNEMPWI 

KSIPFIHLGKQATLADLLNRNNTFKPFAEYKSDYVYQPVPKPVWEQL 
25 FGWLT KP GAGIMVMDP YGAT I S AT PEAAT P FPHRKGVL FN I Q Y VN YW 
FAEAAGAAPLQW SKDI YKFMEP FVSKNPRQAY AN YRDI DLGRNE VVN 
DI S T YSSGKVWGEKYFKGNFQRLAI TKGKVDPQDYFRNEQS I PPLLG 
K 

30 
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Figure 5(a) 

AACTATAGGGCCTTCACGCTGGTGCTCCTCTTCTGCGCCTTGTCCTG 
TCAaGCCgCCGCCACC TACGCGCCGGTGCCTGCCAAGGAGGACTTCC 
TCGGATGCCTCATGAAGGAGATACCGGCACGCCTCCTCTACGCCAAG 
AGCTCGCCTGACTTCCCCACCGTCCTGGCGCAGACCATCAGGAACTC 
GCGGTGGTTGTCGCCGCAGAACGTGAAGCCGCTCTACATCATCACCC 
CCACCAACGCCTCGCACATCCAGTCCGCGGTGGTGTGCGGACGCCGG 
CACAGCGTCCGCCTCCGCGTCCGGAGCGGCGGCCACGACTACGAGGG 
CCTGTCGTACCGGTCCGAGAAACCCGAGACGTTCGCCGTCGTCGACC 
TCAACAAGATGCGGGCAGTGTTGATCGACGGCTACGCCCGCACGGCG 
TGGGTCGAATCCGGCGCGCAGCTCGGCGAGCTCTACTACGCCATCGC 
GAAAAACAGCCCCGTGCTCGCGTTCCCGGCCGGCGTCTGCCCGACCA 
TCGGCGTCGGCGGCAACTTCGCAGGCGGCGGCTTTGGCATGCTGCTG 
CGGAAGTACGGCATCGCCGCCGAGAACGTCATeGACGTCAAGGTGGT 
CGACCCCAACGGCAAGCTTCTCGACAAGAGCTCCATGAGCCCGGACC 
20 ACTTCTGGGCCGTCAGGGGCGGCGGCGGAGAGAGCTTTGGCATCGTC ' 
GTGTCGTGGCAAGTGAAGCTCCTGCCGGTGCCTCCCACCGTGACCGT 
GTTCAAGATCCCCAAGACAGTGCAAGAAGGCGCCGTAGACCTCGTCA 
ACAAGTGGCAACTGGTCGGGCCGGCCCTTCCCGGCGACCTCATGATC 
25 CGCGTCATCGCTGCGGGGAACACCGCGACATTCGAGGGCATGTACCT 
GGGCACCTGCCAAACCCTGACGCCGTTGATGAGCAGCCAATTCCCCG 
AGCTTGGCATGAACCCCTATCACTGCAACGAGATGCCCTGGATCAAG 
TCCATCCCCTTCATCCACCTCGGCAAAGAGGCCAGCCTGGTCGACCT 
30 CCTCAACCGGAACAACACCTTCAAGCCCTTCGCCGAATACAAGTCGG 
ACTACGTGTACCAGCCCTTCCCCAAGCCCGTGTGGGAGCAGATCTTC 
GGCTGGCTCACGAAGCCCGGTGGGGGGATGATGATCATGGACCCATA 
CGGCGCCACCATCAGCGCCACCCCCGAAGCGGCGACGCCGTTCCCTC 
35 ACCGCCAGGGCGTTCTCTTCAACATCCAGTACGTCAACTACTGGTTC 
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GCCGAGGCAGCCGCCGCCGCGCCGCTGCAGTGGAGCAAGGACATGTA 
C AAT T T C ATGGAGCCGTAC GTGAGCAAGAACCCCAGGC AGGCGT AC G 
CCAACTACAGGGACATTGACCTCGGCAGGAACGAGGTGGTGAACGAC 
ATCTCAACCTATAGCAGCGGCAAGGTTTGGGGCGAGAAGTACTTCAA 
GGGCAACTTCCAAAGGCTCGCTATTACCAAGGGCAAGGTGGATCCTC 
AGGACTACTTCAGGAACGAGCAGAGCATCCCGCCGCTGCTCGAGAAG 

TAC TGA TCGAGGACCTTGCATGGAGATTTAGTGCGTGGTTGCCGTTT 
CACAT 

Figure 5(b) 

AACTGTAGGGCCTTCGCGCAGGTGCTCCTCTTCTTCGCCTTGTCCTC 
CCAAgCCGCCGCCACC TACGCGCCGGTGCCTGCCAAGGAGGACTTCC 
TCGGATGCCTCATGAAGGAGATACCGGCCCGCCTCCTCTACGCCAAG 
AGCTCGCCTGACTACCCCACCGTGCTGGCGCAGACCATCAGGAACTC 
GCGGTGGTCGACGCAGCAGAACGTGAAGCCGCTGTACATCATCACCC 
CCACCAACGCCTCCCACATCCAATCCGCGGTGGTGTGCGGCCGCCGG 
CACGGCGTCCGCCTCCGCGTGCGGAGCGGCGGCCACGACTACGAGGG 

CCTGTCGTACCGGTCCGAGAAACCCGAGACGTTCGCCGTCGTCGACC 
TCAACAAGATGCGGGCAGTGGTTGTCGACGGCTACGCCCGCACGGCG 
TGGGTCGAATCCGGCGCGCAGCrCGGCGAGCTCTACTACGCCATCGC 
GAAGAACAGCCCCGTGCTCGCGTTCCCGGCCGGCGTCTGCCCGTCCA 
TCGGCGTCGGCGGCAACTTCGCAGGCGGCGGCTTCGGCATGCTGCTG 
CGCAAGTACGGCATCGCCGCCGAGAACGTCATCGACGTCAAGGTGGT 
CGACCCCGACGGCAAGCTGCTCGACAAGAGCTCCATGAGCGCGGACC 
ACTT CT GGGCCGT CAGGGGCGGCGGCGGAGAGAGCT TCGGC AT CGT C 
GTCTCGTGGCAGGTGAAGCTCATGCCAGTGCCTCCCACCGTCACCGT 
GTTTAAGATCCCCAAGACGGTGCAAGAAGGCGCCGTAGACCTCGTCA 
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ACAAGTGGCAGCTGGTCGGGCCGGCACTTCCCGGCGACCTCATGATC 

CGCGTCATCGCTGCCGGGAACACGGCGACGTTCGAGGCCTTGTACCT 

GGGCACCTGCAAAACCCTGACGCCGCTGATGAGCAGCCAATTCCCCG 

AGCTTGGCATGAACCCCTATCACTGCAACGAGATGCCCTGGATCAAG 

TCCGTCCCCTTCATCCACCTCGGCAAACAGGCTGGCCTGGACGACCT 

CCTCAACCGGAACAACACCTTCAAGCCCTTCGCCGAATACAAGTCGG 

ACTACGTGTACCAGCCCTTCCCCAAGCCCGTGTGGGAGCAGATCTTC 

GGCTGGCTCGCGAAGCCCGGCGCGGGGATCATGATCATGGACCGCTA 

CGGCGCCACCATCAGCGCCACCCCCGAAGCGGCGACGCCGTTCCCTC 

ACCGCCAGGGCGTCCTCTTCAACATCCAGTATGTCAACTACTGGTTC 

GCCGAGCCAGCCGGCGCCGCGCCGCTGCAGTGGAGCAAGGACATTTA 

CAATTTCATGGAGCCGTACGTGAGCAAGAACCCCAGGCAGGCGTACG 

CCAAC T AC AGGGACATCGACC T CGGC AGGAAT GAGGT GGTGAACGAC 

AT C T CAACCT ACAGCAGC GGCAAGGTGTGGGGCGAGAAGT ACT T CAA 

GAGCAACTTCClUiAGGCTCGCCATTACCAAGGGCAAGGTAGATCCTC 

AGGACTACTTCAGGAATGA6CAAAGCATCCCGCCG CTGA TCGAGAAG 

TACTGATCGAGGACCTTGCATGGAGATTTAGTGCGTGGTTGGCGTTT 
CACAT 

Figure 6(a) 

Nyj^Fr.LVI,XFCALSCQAAA2Y APVPAKEDFLGCLMKEIPARLLYAK 
SSPDFPTVLAQTIElNSRWLSPQNVKPLYIITPTNASHIQSAVVCGRR 
HS VRLRVRS GGHDYE GLS YRS EKPE T FAWDLNKMRAVL I DG YART A 
WVE S GAQLGE L YYAI AKNS PVLAFPAGVC PT I GVGGN FAGGG FGMLL 
RKYGIAAENVIDVKWDPNGKLLDKSSMSPDHFWAVRGGGGESFGIV 
VSWQVKLLPVPPTVTVFKIPKTVQEGAVDLVNKWQLVGPALPGDLMI 
RVIAAGNTATFEGMYLGTCQTLTPLMSSQFPELGMNPYHCNEMPWIK 
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S I PFIHLGKEASLVDLLNRNNTFKPFAEYKSDYVYQPFPKPVWEQI F 

GWLTKPGGGMMIMDPYGATISATPEAATPFPHRQGVLFNIQYVNYWF 

AEAAAAAPLQWSKDMYNFMEPYVSKNPRQAYANYRDIDLGRNEWND 

ISTYSSGKVWGEKYFKGNFQRLAITKGKVDPQDYFRNEQSIPPLLEK 
Y 



Figure 6(b) 

10 

NCRAFAQVLLFFALSCQAAAT YAFVFI^DFLGCLMKEIFARLLYhK 
SSPDYPTVLAQTIRNSRWSTQQNVKPLYIITPTNASHIQSAVVCGRR 
HGVRLRVRSGGHDYEGLSYRSEKPETFAVVDLNKMRAVWDGYARTA 

1 5 WVESGAQLGELYYAIAKNSPVLAFPAGVCPSIGVGGNFAGGGFGMLL 
RKYGIAAENVIDVKVVDPDGKLLDKSSMSADHFWAVRGGGGESFGIV 
VSWQVKLMPVPPTVTVFKIPKTVQEGAVDLVNKWQLVGPALPGDLMI 
RVIAAGNTATFEALYLGTCKTLTPLMSSQFPELGMNPYHCNEMPWIK 

20 SVPFIHLGKQAGLDDLLNRNNTFKPFAEYKSDYVYQPFPKPVWEQIF 
GWLAKPGAGIMIMDPYGATISATPEAATPFPHRQGVLFNIQYVNYWF 
AE P AGAAPLQWSKDI YNFME P YVS KN PRQAYAN YRDI DLGRNE VVN D 
ISTYSSGKVWGEKYFKSNFQRLAITKGKVDPQDYFRNEQSIPPLIEK 




Figure 7 

30 

tacttcccgccgccggctgctaaagaagacttcctgggttgcctggt 
taaagaaatcccgccgcgtctgttgtacgcgaaatcgtcgccggcgt 
atccctcagtcctggggcagaccatccggaactcgaggtggtcgtcg 
ccggacaacgtgaagccgctctacatcatcacccccaccaacgtctc 
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ccacatccagtccgccgtggtgtgcggccgccgccacagcgtccgca 
tccgcgtgcgcagcggcgggcacgactacgagggcctctcgtaccgg 
tctttgcagcccgagacgttcgccgtcgtcgacctcaacaagatgcg 
ggcggtgtgggtggacggcaaggcccgcacggcgtgggtggactccg 
gcgcgcagctcggcgagctctactacgccatctataaggcgagcccc 
acgctggcgttcccggccggcgtgtgcccgacgatcggagtgggcgg 
caacttcgcgggcggcggcttcggcatgctgctgcgcaagtacggca 
tcgccgcggagaacgtcatcgacgtgaagctcgtcgacgccaacggc 

m 

aagctgcacgacaagaagtccatgggcgacgaccatttctgggccgt 
caggggcggcgggggcgagagcttcggcatcgtggtcgcgtggcagg 
tgaagctcctgccggtgccgcccaccgtgacaatattcaagatctcc 
aagacagtgagcgagggcgccgtggacatcatcaacaagtggcaagt 
ggtcgcgccgcagcttcccgccgacctcatgatccgcatcatcgcgc 
aggggcccaaggccacgttcgaggccatgtacctcggcacctgcaaa 
accctgacgccgttgatgagcagcaagttcccggagctcggcatgaa 
cccctcccactgcaacgagatgtcatggatccagtccatccccttcg 
tccacctcggccacagggacgccctcgaggacgacctcctcaaccgg 
aacaactccttcaagcccttcgccgaatacaagtccgactacgtcta 
ccagcccttccccaagaccgtctgggagcagatcctcaacacctggc 
tcgtcaagcccggcgccgggatcatgatcttcgacccctacggcgcc 
accatcagcgccaccccggagtccgccacgcccttccctcaccgcaa 
gggcgtcctcttcaacatccagtacgtcaactactggttcgccccgg 
gagccgccgccgcgcccctctcgtggagcaaggacatctacaactac 
atggagccctacgtg.agcaagaaccccaggcaggcgtacgcaaacta 
cagggacatcgacctcggcaggaacgaggtggtcaacgacgtctcca 
cctacgccagcggcaaggtctggggccagaaatacttcaagggcaac 
ttcgagaggctcgccattaccaagggcaaggtcgatcctaccgacta 
cttcaggaacgagcagagcatcccgccgctcatcaaaaagtactga 
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Figure 8 

YFPPPAAKEDFLGCLVKEIPPRLLYAKSSPAYPSVLGQTIRNSRWSS 
PDNVKPLYIITPTNVSHIQSAWCGRRHSVRIRVRSGGHDYEGLSYR 
S LQ PE T FA VVDLNKMRAVWVDGKARTAW VDS GAQLGE L Y YAI YKAS P 
TLAFPAGVCPTIG.VGGNFAGGGFGMLLRKYGIAAENVIDVKLVDANG 
KLH DKKSMGDDHFWAVRGGGGE S FG I VVAWQVKLLP VP P T VT I FKI S 
KTVSEGAVDIINKWQVVAPQLPADLMIRIIAQGPKATFEAMYLGTCK 
TLTPLMSSKFPELGMNPSHCNEMSWIQSIPFVHLGHRDALEDDLLNR 
NNSFKPFAEYKSDYVYQPFPKTVWEQILNTWLVKPGAGIMIFDPYGA 
TISATPESATPFPHRKGVLFNIQYVNYWFAPGAAAAPLSWSKDIYNY 
MEPYVSKNPRQAYANYRDIDLGRNEWNDVSTYASGKVWGQKYFKGN 
FERLAITKGKVDPTDYFRNEQSIPPLIKKY 
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